Extreme values for two-dimensional discrete Gaussian free field 
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Abstract 

We consider in this paper the collection of near maxima of the discrete, two dimensional 
Gaussian free field in a box with Dirichlet boundary conditions. We provide a rough description 
of the geometry of the set of near maxima, estimates on the gap between the two largest 
maxima, and an estimate for the right tail up to a multiplicative constant on the law of the 
centered maximum. 



1 Introduction 

The discrete Gaussian free field (GFF) {rj^ : v G Vjv} on a 2D box of side length with 

■ Dirichlet boundary condition, is a mean zero Gaussian process which takes the value on OVn 
\^ , and satisfies the following Markov field condition for all v G Vn \ OVn- f]^ is distributed as a 
tJ" I Gaussian variable with variance 1 and mean equal to the average over the neighbors given the GFF 

on Vn\{v} (see later for more formal definitions). One facet of the GFF that has received intensive 
. attention is the behavior of its maximum. In this paper, we prove a number of results involving the 

^ I maximum and near maxima of the GFF. Our first result concerns the geometry of the set of near 

■ maxima and states that the vertices of large values are either close to or far away from each other. 

Theorem 1.1. There exists an absolute constant c > 0, 
/\ . lim lim ¥{3v, u G Vn : r ^ \v — u\ ^ N/r and r]^ ,1]^ ^ — clog log r) = , (1) 

where = EmaXt,gVjv Vv ■ 

(The asymptotic behavior of tun is recalled in @ below.) In addition, we show that the number 
of particles within distance A from the maximum grows exponentially. 

Theorem 1.2. For A > 0, let Ajsf^x = {v G Vn '■ rj^ ^ rn^ — A} for A > 0. Then there exist 
absolute constants c, C such that 

lim lim P(ce''^ < |yliv,A| < Ce^^) = 1 . 

A— >cxD N^oo 



•Partially supported by NSF grant DMS-1313596. 

^Partially supported by NSF grants DMS-0804133 and DMS-1106627, a grant from the Israel Science Foundation, 
and the Herman P. Taubman chair of Mathematics at the Weizmann institute. 



1 



Another important characterization of the joint behavior for the near maxima is the spacings of 
the ordered statistics, out of which the gap between the largest two values is of particular interest. 
We show that the right tail of this gap is of Gaussian type, as well as that the gap is of order 1. 

Theorem 1.3. LetT^ be the gap between the largest and the second largest values in {r]y : v G V^}. 
Then, there exist absolute constants c, C > such that for all X > and all N €N 

ce"^^' ^ P(r^ ^ A) ^ Ce-^^' , (2) 
IimIimsupP(rAr ^ (5) = 0. (3) 

<5-5-0 7V_s>oo 

We do not know whether the gap estimate in ([2]) can be improved to a precise Gaussian tail estimate 
as A — )• CO. 

Finally, we compute the right tail for the maximum up to a multiplicative constant. Set M^v = 

N 

maxi, % . 

Theorem 1.4. There exists a constant C > such that for all A G [1, ^log N), 

Related work. In the mathematical literature, the study on the maximum of the GFF goes back 
at least to Bolthausen, Deuschel and Giacomin [T2] who established the law of large numbers for 
Mjv/ log by associating with the GFF an appropriate branching structure. Afterwards, the main 
focus has shifted to the study of fluctuations of the maximum. Using hypercontractivity estimates, 
Chatterjee [H] showed that the variance of the maximum is o(logn), thus demonstrating a better 
concentration than that guaranteed by the Borell-Sudakov-Tsirelson isoperimetric inequality, which 
is however still weaker than the correct 0(1) behavior. Later, Bolthausen, Deuschel and Zeitouni 
[13] proved that (M„— EM„) is tight along a deterministic subsequence (nfc)fcgpj; they further showed 
that in order to get rid of the susbequence, it suffices to compute a precise estimate (up to additive 
constant) on the expectation of the maximum. An estimate in such precision was then achieved 
by Bramson and Zeitouni [17J, by comparing the GFF with the modified branching random walk 
(MBRW) introduced therein. They showed that the sequence of random variables M^r — mjv is 
tight, where 

m^ = 2y27^(logA-|loglogAr)+0(l). (4) 

Using the "sprinkling method" , this was later improved by Ding [23j , who showed that there exist 
absolute constants C, c > so that for ah A^ e N and ^ A ^ (log A)^/^ 

ce"^^ ^ F{Mn ^ mjv + A) ^ Cq-"^ , and ce"^^^^ ^ ¥{Mn ^ mjv - A) ^ Ce-""'^ . (5) 

Note that our Theorem 11.41 gives an improvement upon the estimates on the right tail of A/^v in 
([5]); the precise estimate for the left tail is more challenging because an analogous comparison to 
Lemma 12.61 for the left tail could not be achieved in an obvious way. 

In contrast with the research activity concerning the maximum of the GFF, not much has been 
done until very recently concerning its near maxima. To our knowledge, the only results in the 
mathematical literature is due to Daviaud [20] who studied the geometry of the set of large values 
of the GFF which are within a multiplicative constant from the expected maximum, i.e. those 
values above r/m^v with rj S (0, 1). He showed that the logarithm of the cardinality of the set of 
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such near maxima is asymptotic to 2(1 — rj) log N, and described the fractal structure of these sets. 
Related work for the continuous GFF is contained in Hu, Miller and Peres [31j . 

In contrast with the GFF, much more is known concerning both the value of the maximum 
and the structure of near maxima for the model of branching Brownian motions. The study 
of the maximum of the BBM dates back to a classical paper by Kolmogorov, Petrovskii, and 
Piscounov [33], where they discussed the KPP equation (also known as the Fisher equation). The 
probabilistic interpretation of the KPP equation in terms of BBM, described in McKean [37], was 
further exploited by Bramson [14^ [TS] . It was then proved that both the left and right tails exhibit 
exponential decay and the precise exponents were computed. See, e.g., Bramson [15] and Harris 
|30j for the right tail, and see Arguin, Bovier and Kistler [8] for the left tail (the argument is due to 
De Lellis). In addition, Lalley and Sellke [3l] obtained an integral representation for the limiting 
law of the centered maximum. 

More recently, the structure of the point process of maxima of the BBM was described in great 
detail, in a series of papers by Arguin, Bovier and Kistler [SI [9l [7] and in a paper by Aidekon, 
Berestycki, Brunei and Shi [4]. These papers describe the limit of the process of extremes of BBM, 
as a certain Poisson point process with exponential density where each atom is decorated by an 
independent copy of an auxiliary point process. 

In the physics literature, the link between the extremal process for the GFF and that for BBM 
is often assumed, and some aspects of it (such as tail distributions) are conjectured to hold for a 
general class of logarithmicaly correlated processes, see Carpentier and Le Doussal [18] for (non- 
rigorous) arguments using a renormalization-group approach and links to the freezing transition of 
spin-glass systems, and Fyodorv, Le Doussal, and Rosso [29] for further information on extreme 
distributions. On the mathematical side, numerous results and conjectures have been formulated 
for such models; see Duplantier, Rhodes, Sheffield, and Vargas [26j and Arguin and Zindy [lOj for 
recent progress. 

Our results in this work are a first step in the study of the process of extrema for the GFF. In 
particular. Theorem 11.11 is a precise analog of results in [8], while Theorems 11.21 and 11.31 provide 
weaker results than those of [9] . The results here play an important role in the study of convergence 
of the maximum of the GFF, see Bramson, Ding and Zeitouni [16j. 

Finally, we note that a connection between the maximum of the GFF and the cover time for 
the random walk has been shown in Ding, Lee and Peres [:25] and Ding |22j . In particular, an 
upper bound on the fluctuation of the cover time for 2D lattice was shown in [24] using such a 
connection, improving on previous work of Dembo, Peres, Rosen and Zeitouni [21]. It is worthwhile 
emphasizing that the precise estimate on the expectation of the maximum of the GFF in pT] plays 
a crucial role in [24j. 

A word on proof strategy. A general approach in the study of the maximum of the GFF, 
which we also follow, is to compare the maxima of the GFF and of Gaussian processes of relative 
amenable structures; this is typically achieved using comparison theorems for Gaussian processes 
(see Lemmas 12.21 and 12. Sp . The first natural "comparable" process is the branching random walk 
(BRW) which admits a natural tree structure (although [12] do not use directly Gaussian compar- 
isons, the BRW features implicitly in their approach). In p/Tj, the modified branching random walk 
(see Subsection 12. ip was introduced as a finer approximation of the GFF, based on which a precise 
(up to additive constant) estimate on the expectation of the maximum was achieved. 

Our work also uses comparisons of the GFF with the MBRW/BRW. One obstacle we have 
to address is the lack of effective, direct comparisons for the collection of near maxima of two 
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Gaussian processes. We get around this issue by comparing a certain functional of the GFF, 
which could be written as the maximum of a certain associated Gaussian process. Various such 
comparisons between the GFF and the MBRW/BRW are employed in Section [2j In particular, we 
use a variant of Slepian's inequality that allows one to compare the sum of the m-largest values for 
two Gaussian processes. Afterwards, estimates for the aforementioned functionals of MBRW/BRW 
are computed in Section [3l Finally, based on the estimates of these functionals of the GFF (obtained 
via comparison) , we deduce our main theorems in Section HI 

Along the way, another method that was used often is the so-called sprinkling method, which 
in our case could be seen as a two-level structure. The sprinkling method was developed by Ajtai, 
Komlos and Szemeredi [5j in the study of percolation, and found its applications later in that area 
(see, e.g., [6l[ll]). Under the framework of the sprinkling method, one first tries to understand a 
perturbed version of the targeted random structure, building upon which one then tries to establish 
properties of the targeted random structure. Such a scheme will be useful if a weak property on 
the perturbed random structure can be strengthened significantly to the targeted structure with 
relatively little effort by taking advantage of the perturbation. In the context of the study of the 
maximum of the GFF, the sprinkling method was first successfully applied in [23]; an application 
to the study of cover times of random walks appears in [22j . 

Discussions and open problems. There are a number of natural open problems in this line of 
research on the GFF, of which establishing the limiting law of the maximum and the scaling limit 
of the extreme process are of great intereslll]. Even partial progresses toward these goals could be 
interesting. For instance, it would be of interest to provide more information on the joint behavior 
of the maxima by characterizing other important statistics. We also point out that we computed 
the exponent only for the right tail as in Theorem 11.41 but not for the left tail. A conceptual 
difficulty in computing the exponent in the left tail is that the MBRW has Gaussian type left tail 
(analogous to BRW) as opposed to doubly-exponential tail in ([5|) — the top levels in the MBRW 
could shift the value of the whole process to the left with a Gaussian type cost in probability, 
while in the GFF the Dirichlet boundary condition decouples the GFF near the boundary such 
that the GFF behaves almost independently close to the boundary. Therefore, it is possible that 
a new approximation needs to be introduced in order to study the left tail of the maximum in 
higher precision (merely using the sprinkling method as done in j22] seems unlikely to yield even 
the exponent). We note that these last comments do not apply to the so-called massive GFF, see 
Chatterjee [19], which is expected to behave analogously to BBM. 

Three perspectives of Gaussian free field. A quick way to rigourously define the GFF is to 
give its probability density function. Denoting by / the p.d.f. of (?],,), we have 

/((x^)) = Ze~T6 T.^^.{^u-x.? ^ (g) 

where Z is a normalizing constant and = for v G dV^- (Note that each edge appears twice in 
©•) 

Alternatively, a slower but more informative way to define the GFF is by using the connection 
with random walks (in particular. Green functions). Consider a connected graph G = (y,E). For 
U C V, the Green function Gu{-, •) of the discrete Laplacian is given by 

Gu{x, y) = E^{Zll-H{Sk = y}) , for all X G F \ [/, y G F , Gu{x, y) = for x G ?7, y G F , (7) 

^The convergence of the law of the recentered maximum has been recently proved [16], using results of the current 
paper 
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where tjj is the hitting time of the set U for a simple random walk (Sk), defined by (the notation 
applies throughout the paper) 

Tu = mm{k ^ : Sfc G [/} . (8) 

The GFF {ry„ : v G V} with Dirichlet boundary on U is then defined as the mean zero Gaussian 
process indexed by V such that the covariance matrix is given by Green function {Gu{x,y))x,y<^v ■ 
Clearly, = for all v £ U. For the equivalence of definitions in ([6]) and ([7]), c.f., [32j. 

Finally, we recall the connection between the GFF and electrical networks. We can view the 
2D box V/v as an electrical network if each edge is replaced by a unit resistor and the boundary 
is wired together. We then associate a classic quantity to the network, the so-called effective 
resistance, which is denoted by -Reff(") ■)• The following well-known identity relates the GFF to the 
electric network, see, e.g., [32l Theorem 9.20]. 

EilJu-Vvf = ^Re{iiu,v). (9) 

Note that the factor of 4 above is due to the non-standard normalization we are using in the 2D 
lattice (in general, this factor is 1 with a standard normalization). 

Acknowledgement. We thank two anonymous referees and the editor for useful comments on an 
earlier version of the manuscript. We thank Vincent Vargas for pointing out an annoying typo in 
an earlier version of the paper. 

2 Comparisons with modified branching random walk 

In this section, we compare the maxima of the Gaussian free field with those of the so-called 
modified branching random walk (MBRW) , which was introduced in [T7] ; the advantage of dealing 
with the MBRW is that its covariance function, like that of the GFF but in contrast with BRW, 
depends on the Euclidean distance, as we explain next. 

2.1 A short review on MBRW 

Consider N = 2" for some positive integer n. For k £ [n], let be the collection of squared boxes 
in of side length 2^ with corners in Z^, and let BT^k denote the subsets of Bk consisting of squares 
of the form ([0, 2^ - 1] nZ)^ + {i2^,j2^). For v € I?, let Bk{v) = {B £ Bk : v £ B] he the collection 
of boxes in Bk that contains f , and define BVk{v) be the (unique) box in BVk that contains v. 
Furthermore, denote by B^ the subset of Bk consisting of boxes whose lower left corners are in V^. 
Let {ak,B}k'^o,BeB'Dk be i.i.d. standard Gaussian variables, and define the branching random walk 
to be 

'^v = Y2=0(^k,BVk{v) ■ (10) 

For k S [n] and B € B^ , let hk^B be independent centered Gaussian variables with \ai{hk,B) = 2"^^^, 
and define 

hlB = hk,B',ioiB^NB' £B^ , (11) 

where B B' if and only if there exist i,j E Z such that B = {iN,jN) + B' (note that for 
any B € Bk, there exists a unique B' G B^ such that B '^]\f B'). In a manner compatible with 
definition in (jlip . we let d^{u,v) = min^^j^y \\u — w\\ be the Euclidean distance between u and v 
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when considering Vn as a torus, for all n, u G V^. Finally, we define the MBRW {$,^ : v G Vat} 
such that 

= Ylk=oYyB£Bkiv)^k,B ■ (12) 

The motivation of the above definition is that the MBRW approximates the GFF with high preci- 
sion. That is to say, the covariance structure of the MBRW approximates that of the GFF well. 
This is elaborated in the next lemma which compares their covariances (see \17\ Lemma 2.2] for a 
proof). 

Lemma 2.1. There exists a constant C such that the following holds with N = 2"' for all n. 

|Cov(e^,e,^) - (n - log2(d^(n,i;)))| ^ C, for all u,v (^Vn , 
|Cov(r?^^,r/^^)- 2M2(n-(0Vlog2||n-^;||))| ^ C, for all u,v e {2N,2N) + Vn ■ 

2.2 Comparison of the maximal sum over restricted pairs 

In this subsection, we approximate the GFF by the MBRW based on the following comparison 
theorem on the expected maximum of Gaussian process (See e.g., |28j for a proof). 

Lemma 2.2 (Sudakov-Fernique). Let A be an arbitrary finite index set and let {Xa}aeA o,nd 
{Ya}a<^A be two centered Gaussian processes such that 

EiXa - Xhf ^ E(y, - Y^f , for alla,b£A. (13) 

Then E maXa£_4 ^ E maXa(=_A_ Ya- 

Instead of directly comparing the expected maximum as in |17j . we compare the following two 
functionals for GFF and MBRW respectively. For an integer r, define 

7?^,, = maxjr/^ + r]^ : u,v £ Vat, r ^ \\u - v\\ ^ N/r} , (14) 
eN,r = max{C„^ + : „ g y^, ^ ^ ||^ _ ^|| ^ N/r} . 

The main goal in this subsection is to prove the following. 

Proposition 2.3. There exists a constant k G N such that for all r, n ^ k positive integers, and 
with N = 2", 

In order to prove the preceding proposition, it is convenient to consider 

VN,r = max{77^^(27V,27V) + ^t+(2N,2N) ■u,v£VN,r \\u - v\\ < N/r} . (15) 
We start with proving the next useful lemma. 
Lemma 2.4. Using the above notation, we have 

(a) P(max„gy^ ??f ^ A) ^ 2P(max^,gyj^ ^t,+{2N,2N) ^ ^) f'^'^ A G M. 
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Proof. Denote by Vj^ = {v + {2N,2N) : v G Viv}, and consider the process r]^ as indexed over 
the set (so it is a Gaussian free field on V/v with index shifted by {2N,2N)). Note that the 
conditional covariance matrix of {rit^}vevif given the values of {^/^^}„gV4jv\v^ corresponds to the 
covariance matrix of {r]^}veVl^- This implies that 

{r?r -.VGV;,} {v^ + E(ryr I ■-uGV.^X F^}) : v G V^} , (16) 

where on the right hand side {rj^ : v G V^} is independent of {?7^^ : u G V47V \ V^} and both are 
defined on the same probability space. Write 

cP, = E(ryr I ■■uGV.nX V^,}) = E^'' \ {r/^^ : u G dVl,}). 

Note that 0^ is a linear combination of {r/.^^ : u G 5V^}, and thus a mean zero Gaussian variable. 
By the above identity in law and the independence, we derive that 



,0 

,r 1 



where {ti,T2) is the pair at which the sum in the definition of 7?^^, is maximized (see (I14|) and 
(jlSp ). and the second equality follows from the fact that (pn and (j)r2 has mean 0. This completes 
the proof of Part (i). Part (ii) follows from the same argument, by noting that almost surely. 



max rjf, ^ max(r/^ )' + 



where r G is the maximizer for {r]y^ : v G V^} and , for v G Vat, (??^)' = flt+{2N2N) ~ 
E(r/„_|_(2Ar,27V) I {^u^)"" S dV^) is distributed like a GFF. The desired bound follows from the fact 
that (pr given the location r is a centered Gaussian variable independent of (max^evw Vv)' • '-' 

Proof of Proposition 12.31 For the upper bound, by the preceding lemma, it suffices to prove 
that E^^^ ^ E^2''Arr- ^^"^ ^^^^ purpose, define the mapping ipN '■ Vn ^ y2"-N by 

^n{v) = {2''~'^N, 2''~^N) + 2^-''^v , ioiveVN. (17) 

Applying Lemma 12. H we obtain that there exists sufficiently large k (that depends only on the 
universal constant C in Lemma l2.ip such that for all v,u,v',u' G Vn, 

TU't 4:N , 47V 4N AN \2 

'^\Vv+{2N,2N) ~^ Vu+{2N,2N) %'+{2N,2N) ^u'+{2N,2N)> 

In order to verify p8|) , note from the definition that the variance of i'^^^y^ grows with k (more 
levels are involved) while, for all u, w G Vat, the covariance between and does not grow 

(the number of common levels remains constant, since points are taken farther away due to the 
definition of V'Af)- This observation allows us to select k large so as to increase the right hand side 
in (jlSp . Now, an application of Lemma 12.21 for the processes 

{^^+(27V,2iV) + vl+(2N,2N) -u^veVN^rf^ \\v - u\\ N / r} , 
and {^^(Cm + eZu)) ■■u,veVN,r^\\v-u\\^ N/r] 
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yields that Ef?^,, ^ y^^^C2-N.r- Here we used the fact that r ^ Wi'Niv) - iJNiu)\\ ^ 2''N/r for 
ah u,v £ Vjy such that r ^ ||f — u\\ ^ N/r. 

The lower bound follows along the same line, which we now sketch. Analogous to (jlSp . we can 
derive that for all u,v,u',v' G 1^2-«Ar 

^l^Va-KjvM +^^2-«ivW "^'Aa-KivK) ~^^2-iv(«')^ ^ ^ ^^^^ + ^« ~ ~ > ' 

Combined with the fact that r ^ ||V'2-«Af(^) ~ V'2-'=Af(^)ll ^ -^7'' foi" "^-iV £ ^2-«Af such that 
r ^ ||u — I'll ^ jr^ another application of Lemma 12.21 completes the proof. 

2.3 Comparison of the right tail for the maximum 

In this subsection, we compare the maximum of GFF with that of MBRW in the sense of "stochastic 
domination" , for which we will use Slepian's [38] comparison lemma. 

Lemma 2.5 (Slepian). Let A be an arbitrary finite index set and let {Xa}aeA '^^^ {^a}aeA ^6 
two centered Gaussian processes such that (jl3p holds and VarX^ = Vary^ for all a £ A. Then 
¥{ma.XaeAXa ^ X) > ¥{ma.Xa^j^Ya > A), for all A G M. 

Remark. The additional assumption on the identical variance in Lemma 12.51 allows for a compar- 
ison of maxima of fields that goes beyond comparisons of expectations. On the down side, it forces 
us to modify our fields before we can apply the lemma. 
The main result of this subsection is the following. 

Lemma 2.6. There exists a universal integer k > such that for all N and A G M 

ip(max„ey^_^y^er"^ ^ A) ^ P(max,ev^r?f ^ A) ^ 4P(max,ey,., v^^'""^ ^ ^) ' 

Proof. We first prove the upper bound in the comparison. In light of Part (ii) of Lemma 12. 4| it 
suffices to consider the maximum of GFF in a smaller central box (of half size), with the convenience 
that the variance is almost uniform therein. Indeed, by Lemma 12. H we see that for a universal 
constant C > 

I Var rjt^ - Var r/^^| < C , for all u,v e {2N, 2N) + Vn ■ (19) 

Let V'Af be defined as in (fT7|) . It is clear that for k sufficiently large (independent of A^), we have 
Var 77^^j.2jY 2Ar) ^ ^ ^ Var C^*^^,;) for all v G Va^- Therefore, we can choose a collection of positive 
numbers {a^^veVN such that 

Var (r?,t(2^,2iV) + «.^) = ^ Var ^J^) , (20) 

where X is an independent standard Gaussian variable. Furthermore, due to (jl9p and the fact that 
the MBRW has precisely uniform variance over all vertices, we have for a universal constant C > 

\au — av\ ^ C , for all u,v G Vn ■ 

This implies that 

^iivt+(2N,2N)+"-vX)-{r]f/l^2N,2N) + '^uX)f ^ Hivt+i2N,2N)-^t+{2N,2N)f + ' for "U, V G Vat • 
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Combined with the fact that E(^^'^^^ ~ ^'^nIv) ')'^ grows (Unearly) with k and Lemma [2. 11 it fohows 
that for K sufficiently large (independent of N) and for all u,v £ V/v 

E(Wi,+(2JV,2iV) + (^v^) - iVu+(2N,2N) + ^ -^^i^^Pi^iu) " ^M^V ' ^^^^ 

Combined with (I20p , an application of Lemma 12.51 yields that 

F{max,^Vr,vt+i2N,2N) + avX ^ A) ^ w{^f^max,^vAi7{v) > ^) ' ^r aU A G M . (22) 
It is clear that 

P(max^eVjv'?^+(2Ar,2Ar) + "'^X ^ A) ^ P(max^ev'jv'?^+{2Af,2Af) ^ A,X ^ O) 

= ip(max^6y^r?^^(2iV,2JV) ^ ^) ■ 

Combined with (j22p . the desired upper bound follows. 

We now turn to the proof of the lower bound, which shares the same spirit with the proof of the 
upper bound. Recall the definition of -02 

pT|) . Using Lemma ETT] again, we obtain that 
I Var - Var K C , for all u,v e Fa-TV • 

It is also clear from Lemma l2.ll that Var ^^^^ ^ 2 log 2 ^2 '^N^ ^ sufficiently large (in- 

dependent of A^) and for all v S V2-K7V Continue to denote by X an independent standard 
Gaussian variable. We can then choose a collection of positive numbers {a'^ : v £ V2-k^} satisfying 
Wv ~ ^'u\ ^ ^ such that 

Var = ^ Var(er'^^ + a,X) , for ah v G V^-.^ . 

Analogous to the derivation of (pTj) . we get that for k sufficiently large (independent of A^), 

IE((<„.^(.) - ^ ^]E((d"^^ + <X) - (d^^^ + a^Xyf , for aU u, v G V^-.^ ■ 

Another application of Lemma 12.51 yields that for all A G M 

P(max,gv^^_^r?^^_^(,) ^ A) ^ Fi^/^rm.^^^v.^^X^-^'' + ^'^^) ^ ^) 

^ P(V^max.ey^_.,er'^^ ^ A,X ^ 0) 
= M^/^^--^.v.-.jr'' > A) . 
Combined with the fact that V'2-'=Af(^) ^ for all v G V2-K7V! this completes the proof. □ 

2.4 Comparison of the maxima of sums of particles 

We conclude this section with a comparison between the Gaussian free field and branching random 
walk, which will be used in the proof of Theorem 11.21 
We need the following variant of Slepian's inequality. 
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Lemma 2.7. Let X = [Xi : i G [n]) and Y = (li, . . . ,y„) 6e too mean-zero Gaussian processes 
such that ¥.Xf = EF.^ and 'EXiXj ^ "^YiYj for all i,j G [n]. Fix 1 < m ^ n, and define 

Proo/. For /3 > 0, define : M by 

F/3(x)=/3-ilog 5^ e'^''^ 

where we denote by Qm = {A CI [n] : |^| = m} and = J2i£A ^i- prove below that 

^'^Fp/^x^^Xj ^ , i^j. (23) 
Then, by |36[ Theorem 3.11], one has that 

EFp{y.) ^ EFp{Y) . 

Takmg /3 — )• oo yields the lemma. 

It remains to prove (|23p . For k E [n] and / C [n], we set = {-B C [n] \ / : \B\ = k}. Then, 
for i / j. 



The inequality (I23p follows from the following combinatorial claim. □ 
Claim 2.8. For all i,j,m G [n] and /? > 0, we have 



Proof. Fix a sequence (ai,...,o„) such that G {0,1,2} for all I £ {0,1} and 

Ef = 2m — 2. We count the multiplicity of the term in the left (denoted by L) 

and right hand sides (denoted by ii), respectively. Let k = |{^ G [n] \ {i,j} : = 1}|. It is 
straightforward to verify that 



L 



(fc/2+i) , if Oi + Oj = , ( (^^2) ' if a* + Oj = , 



(fe^ 1)72) ' if flj + = 1 , and R = < 
I, \{k~2)/2) ' if + = 2 ; 



U/2; 

((fc-l)/2) ' if + = 1 . 
({fc-2)/2) ' if o-i + fflj = 2 . 



Therefore, we always have L ^ R, completing the proof of the claim. □ 

We now demonstrate a comparison for the maxima of sums of values between the GFF and the 
BRW. 

Lemma 2.9. For N = 2^ with n G N, let {rj^ : v G Vat} be the Gaussian free field and {-d^ : v G V/v} 
the branching random walk as defined in (llOp . For ^ G N, define 



Si^N = max{Et,eA^f ■.\A\=i,Ac Vn} , and TZi^n = \J max{E,,eA'^t' : |^| = ^, A C Vat} 
Then, there exists absolute constant k G N such that E5^ ^ ^ E7^^ Ar2'^ . 
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Proof. Consider t?* = t?„ + kX„ where are i.i.d. standard Gaussian variables, and define TZ*^ ^ = 
y/2 log 2/7r max{^^g^ 'dl : \A\ = e,A C Vn}- Clearly, E7^^^^ ^ ^Re,N2-- Let X be another 
independent standard Gaussian variable and choose a non-negative sequence {a^ : f G (2A^, 2A^) + 
V/v} such that 

Var(7?^^ + a^X) = Var , for aU v G (2iV, 2iV) + Vn ■ (24) 
By Lemma \2.1\ we see that la^ — ai,| ^ C for an absolute constant C > 0. Further define 

5;^^ = ^^^{T.veA+{2N,2N)Vt^ + (^vX : 1^1 = £,yl C Vat} . 

Using similar arguments as in the proof of Lemma 12.4^ we deduce that EiS^^tv ^ E5|^. Therefore, 
it remains to prove E5|^ ^ E7^^^. To this end, note that we can select k = 4(7 such that for all 

^{K'&u) ^ ^{{'nt+{2N,2N) + ^v+{2N,2N)X){vt+{2N,2N) + a,,+(2Af,2Ar)-'^)) • 

Combined with (p^ and Lemma [2771 it completes the proof. □ 

3 Maxima of the modified branching random walk 

This section is devoted to the study of the maxima of MBRW, from which we will deduce properties 
for the maxima of GFF. 

3.1 The maximal sum over pairs 

The following lemma is the key to controlling the maximum over pairs. Set ttt-tv = Y^vr/2 log 2 • niN- 
Lemma 3.1. There exist constants ci,C2 > so that 

2rhN — C2 log log r ^ E^^ ^ 2mjv — ci log log r . 

We consider first a branching random walk {^f : i = 1,... ,4"}, with four descendants per 
particle and standard normal increments. Note that {"d^ : v £ Vn} as defined in (jlOp is a BRW 
with four descendants per particle and n generations. We use different notation in this subsection 
that allows us to ignore the geometrical embedding of the BRW into the two-dimensional lattice. 
Let Tn be the maximum of the BRW after n generations. Let c* = 2-v/log 2, c = (3/2) /c* and 
tn = c*n — c log n. We need the following estimates on the right tail of the maximum of a BRW. For 
the lower bound, we refer e.g. to [2j and to [39', (2. 5. 11), (2. 5. 13)]. One can obtain the upper bound 
by adapting, with some effort, Bramson's argument in [15]; this is done in detail in [161 Lemma 
3.7, 3.8]. Alternatively, one can refer to [3l Prop. 4.1] for most of the content of Lemma |3.2[ 

Lemma 3.2. The expectation ETn satisfies 

Er„ = c*n-clogn + 0(l). (25) 
Further, there exist constants c, C > so that, for y £ [0, \/n\, 

ce-'^'y ^ P(r„ ;^ t„ + y) ^ C(l + y)e-''*y , (26) 
with the upper bound holding for any y ^ 0. 
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We remark that [3", Prop. 4.1] implies (in a much more general setting than that considered 
here) a lower bound in (I26p that matches the upper bound (up to a multiplicative constant); we 
will not directly use this. Further, (j26|) implies that with an independent copy of T^, there exists 
a constant C such that 

P(r„ + T;, ^ 2tn + 2y) ^ Yl ^^'^^ ^ + [•?■' ^ + ^ + 2y - (j + 1)) 

[2(^-1)1 

^ 2F{Tn ^ tn + 2y) + HTn > tn + JMT;^ ^ tn + 2y - {j + 1)) 

j=0 

^ C{l + yfe~^^'y ^C{l + y)^e-^'''y (27) 

for any y ^ and any positive integer n. 
For x G Z, let 

H„(x) = #{1 ^ ^ ^ 4" : e [t„ - X - l,t„ - x]} 

be the number of particles in the BRW at distance roughly x behind the leader. The following is 
essentially folklore, we include a proof since we have not been able to find an appropriate reference. 

Proposition 3.3. For some universal constant C , and all x G Z, 

EH„(x) ^ C7ne"*^-^'/2n _ ^28) 



Further, for any u > —x so that < x + ti ^ y^n/2, 

P(H„ (x) ^ e'^* (^+") ) < Ce-^*"+^ . (29) 

Note that the interest in (j29p is only in situations in which x + u\s at most at logarithmic scale 
(in n). 



Proof. The estimate (I28p is a simple union bound: with G a zero mean Gaussian with variance n 
we have 

ES,(x) = 4"P(G G [t„ - X - l,t„ - x]) . 

Using standard estimates for the Gaussian distribution and the value of tn, the estimate ([28]) follows. 

We write the proof of (|29p in case x ^ 0, the general case is similar. We use Lemma 13.21 Fix 
5 > 0, r = 2(x + u)"^ and y = u — clogr. Note that clogr + y + x < y/r. With K an arbitrary 
positive integer, 



P(r„+, ^ tn+r + y)> IP(H„(x) -^K) I- {F{Tr^tr + clogr + y + x- clog(l + r/n))) 



I _ (l _ (jQ-c'iy+x+clogr) 



^ P(H„(x) ^ K) 



where in the last inequality we used the lower bound in 
^-c'(y+x+ciogr)j^ ^ ^ ^^^^ therefore 



K 



(30) 



Taking K = e'^*(^+") we have that 



F{Tn+r ^ tn+r + V) > cP(H„(x) ^ K) . 

Using the upper bound in (j26p we get that 

P(H,(x) ;^K) ^Ce-^*J'(l + y). 

This yields ([29]). 



□ 
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In what follows, we write i ~s j if the particles and Xj' had a common ancestor at generation 
n — s. In the next corollary, the precise value of the constants appearing in the exponent is of no 
particular significance (nor have we tried to optimize over those). 

Corollary 3.4. There exists a constant C > such that, for any s ^ n/2 positive integer, and any 
z positive, 

P(3ii ~, i2 : Xl[ + XI ^ 2tn - clog s + z)^ C[e-°-9"*" + e-0-45^*^-0-7i°s^] . (31) 

Similarly, 

P(3n i2 : XI + XI ^ 2tn - clog s + z) ^ C[e-°-9^*^ + e-0-45<=*-o.7iogs] _ (32) 

In particular, there exists an vq such that for all r > r^ and all n large, 

E max {Xl+Xl)^2tn-{c/A)\ogr. (33) 

n~s«2, se[r,n— r] 

Proof. We first provide the proof of ()3ip : the claim ()32p follows similarly and (I33p will then be an 
easy consequence. 

The argument (given the estimates in Proposition 13.31 is straightforward and routine, even if 
tedious; it requires controlling the number of particles, at generation n — s, that are near tn-s — x, 
that is En-six), and dividing to cases according to x and the different possible values of H„_s(x). 

In what follows we set u* = u*{x,z) = max(|a;|,z) and j* = j*{x,z) = \u*~\. We also define 
zL^^ =Z_n{x: |x| ^ (z + clogs)/2}, zL^^ =Z_n{x: > (z + clogs)/2} and Z„ = {x G Z : ^ 
X + n* ^ Y^n/4}. (For negative x one has to exercise some care, this is the reason for the definition 
of Zi^) and Zi^\) 

The starting point of the proof of (I3ip is the following estimate, obtained by decomposing over 
the location of particles at generation n — s. 

P(3ii ~, i2 : Xf^ + X,'; ^ 2tn -clogs + z) 
^^P(H„_,(x) ^e^*("+"*))+ 

Yl ^i^n~s{x) ^ e'=*("+^'))e^*("+^'+^¥(T, + ^ 2t, + z + 2x + clog+ s) + 

xez+n^n j=o 

Y Yl ^i^n-six) ^ e-^* )e^*("+^'+^)p(r, + T',^2ts + z + 2x + clog+ s)+ 

Y IE(H„_,(x))P(T, + ^2ts + z + 2x + clog+ s)+ 

Y IE(H„„,(x))P(T, + ^2ts + z + 2x + clog+ s) + Y ^i^n-s{x) > 1) 

=:YMx)+ Y Mx)+ Y Mx)+ Y Mx)+ Y Mx)+ Y ^6(2;) 
xez xez+n^„ xezL'^n^^ xez^^'n^ji x&+r^z- ^^^(2) 

= : A1+A2+A3 + A4 + A5 + AG, (34) 
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where is an independent copy of T^. The contribution to Ai from x G Zn can be estimated using 
()29|) and one finds 

oo 

Ai{x)f^CY, e-'=*^+^^°g+^ + 2C^e-'=*^'+'^^°g+^ ^ Ce^^°>^+^e-^*^ . (35) 



A similar computation using (p9|) and (p7j) yields 

It* 

^ A2(x) ^ C ^ ^e-'=*^'+^^°s+("+^')e^*("+^'+i)e-^*(^+2x+c-iogs)(^^|^|^-^^g^^4 

a;ez+n2„ xsz+n^n i=o 

^ C7(l + logs)^e^'°s+^e-c*^_ (36) 

To control A3, we repeat the last computation and obtain 

^ C(l + logs)^e^'°g+^e-^*^/2-c*5iogs/2^ (37) 

To control Aq over Zn, we repeat the estimate as in controlling Ai and obtain 

Y M^) ^ C{1 + log s + ^)e-^*-/2+c*5iog^/2 . (38) 

The estimate for x is easier, using this time (j28p . Indeed, in such a situation either or 
z are at least of order y/n. One has 

Y Mx) Y lEHn-s(x) • e-'=*(^+"*) < Y Cne-'^*"*-^'/" ^ ^-0.9c*z-2logn ^ 

(The constant 0.9 does not play a particular role in the last inequality, all that is needed is that 
it is smaller than 1 and close to 1 and that e"'''^'^*"* < for all n large and x Zn ) Since 

log s < log n we get 

^ yli(x) < Cs"V°■9"*^ (39) 

Similarly, 

Y Mx) Y + Z + X + clog s)4„e-"*(^'+")-^'/2n-c*Elogs ^ ^-0.9c'z _ (^q) 

x&z+nz:^ x&z+nz^ 

As mentioned earlier, for negative x E Z^ one has to exercise some care, this was the reason for 
the definition of Z^^ and One has, using ([2 



J]] A4(x)^(:7 ^ n(l + z + |x| + clogs) V"*(^+"+^^°s^) 

^ 0.45c*2— 0.99c*clogs ^ g—0.45c*2— 0.7 logs (^-|^) 
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where we have used that c*c = 3/2, and again the choice of 0.99 as the constant multiplying c*c is 
of no real importance except that it is close enough to 1. Finally, just using (I28p . we get similarly 

Summing ([35]) - (jl2]) yields ([3T]) . As mentioned before, the proof of ([32]) is similar. Because c*c = 3/2 
and 0.9 • 3/2 > 1 we also have then that 

F{3s e {r, . . . , n/2}, 3^ ~, is : Xf^ + Xf^ ^ 2t„ - (c/4) log r + z) 

n/2 

^ ^ (^|-g-0.9c*(z+clog(s/ri/-')) _|_ g-0.45c*(2+clog(s/r0-2^))-0.71ogsj ^ ^g-0.45c*z 
s=r 

A similar estimates holds for the range s £ {n/2, . . . ,n — r}. Summing those over z yields (|32p . 
We omit further details. □ 

We can now provide the 

Proof of Lemma 13.11 We begin with the upper bound. The argument is similar to what was 
done in the proofs in Section [2] and therefore we will not provide all details. 

Let 5^ be a BRW of depth n and set = {1 - eN)S^ + Gy where is a collection of i.i.d. 
zero mean Gaussians of variance cr^ to be defined (independent of N) and £n = 0(l/n). Choosing 
a and £n appropriately one can ensure that E{{R^)'^) = E{{^^)'^) and that E{{R^ — R^)'^ ^ 
^ii^u Applying Lemma [2. 71 and Corollary [331 we deduce the upper bound in Lemma [3.11 

We now turn to the proof of the lower bound. The first step is the following proposition. In 
what follows, 1^^^ is defined as .^^^ except that the maximum is taken only over pairs of vertices 
at distance at least N/4: from the boundary, and the top two levels of the MBRW are not added. 

Proposition 3.5. There exist constants Ci, C2 > such that for all N large and all r, 

P(a,r ^ 2m,v - Ci log log r) ^ C2 . (43) 

We postpone the proof of Proposition 13 . 51 and show how to deduce the lower bound in Lemma [3. II 
from it. Fix C = 2*^ > 1 integer and consider the MBRW ' in the box Vcn with levels up to 
n = log2(A^/4) (that is, the last c + 2 levels are not taken), and define cr ™ ^ natural way. By 
independence of the field in sub-boxes of side A^/4 that are at distance at least A^/2 of each other, 
we get that 

ttrrr ^ 2rhN - Gi log log r) ^ 1 - (1 - Glf"''' . 



Adding the missing c + 2 levels we then obtain, by standard estimates for the Gaussian distribution, 

F{ecN,r > "^rhM - Gi log log r - y) ^ 1 - (1 - C|)^'/2 - G^e'^^y"/'' . 
Renaming N, we rewrite the last estimate as 

¥{eN,r ^ '^rhN - Gi log log r - y - C5C) ^ 1 - (1 - Glf'l^ - Cge-^^J''/^ . 
Choosing y = G^c and summing over c we obtain that ^Cnt ^ ^rfiN — Gq log log r, as claimed. 
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Proof of Proposition 13.51 We consider Vn as being centered. There are two steps. 

Step 1 We consider the MBRW from level n — logr — 1 to level 1. That is, with r fixed define 



iv = E E h^B, and^„,, = VWAn(-Zj . (44) 



For each x G An^r, let Vjv,r(2^) denote the 7? box centered at x with side N/2r. We call y G A^^r 
a rig'/ii neighbor of x G An,r if = y2 and yi > xi satisfies yi = xi + N/r, and we write y = xr. 
Finally, we set, for x G Ajy^r, 

C* — 

iN,r,x — ™ax . 

Note that, by construction, the collection {Cn x}x<^A„,r is i.i.d. 
A straightforward adaptation of [IT] shows that 

^iCN,r,x > ^N/r " c) ^ g{c) , (45) 

where g{c) — )-c-s.oo 1 is independent of N,r. Let C* be the (unique) element of VN,r{x) such that 

Mn,t,c = {x G An^r ■ CN,r,x ^ "^-Af/r " C, ?Ar,r,a;_R ^ "^-Af/r " c} . 

By independence, we get from that there exists a constant c, independent of N,r, so that 

n\MN,r,c\ ^ r^/^) ^ ^ . (46) 

Step 2. For X G MN,r,c, set ^^^^ .j, = ?Ar,r,a;+^Af,r,a;fl! ^^^^^ SUch X, OUe has ^Ar,r-,x ^ 2mjv/r-2c. 

Define, for G Vat, 

n 

>;^= E E €b, (47) 

fc=n-log2 r BeSfe (d) 

and for x G ^Afr, set 

Conditioned on the sigma algebra TN,r generated by the collection of variables {Q n}^ collection 
{Z^}x is a zero mean Gaussian field, with (conditional) covariance satisfying 

mZ^Z^) - 4(log2 r - log2(|x - y\/{N/r))\ ^ C , 

for some constant C independent of N,r] here, E denotes expectation conditioned on J-N,r- 

It is then straightforward, using the argument in the proof of Proposition 5.2 in |17j . to verify 
that Z"^ = maxjjgMjv r c Z^ is comparable to twice the maximum of MBRW run for log2 r genera- 
tions, i.e. that on the event |M7v,r,c| ^ r^/4 there exist positive constants ci, C2 independent of r, N 
(but dependent on c) such that 

P(Z=V > 2m, - ci) ^ C2 , 

We now combine the two steps. Let x*^ be the (unique) random element of Mj\f^r,c such that 
= Z^^. Then, on the event \MN,r,c\ ^ ?'^/4, we have 

iN,r ^ + 2m^/, - 2c . 
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Therefore, with probabiHty at least g{c) -02, we get that 

1^ ,, ^ 2{mr + rhj^/r) - C4 ^ 2mN - C5 log log r , 

completing the proof of the proposition. □ 
Combined with Proposition 12.31 Lemma l3 . 1 1 immediately gives the following consequence. 

Corollary 3.6. There exist absolute constants ci,C2,C > so that 

2mj\f — C2 log log r — C ^ ^Vn r ^ 2m]y — c\ log log r + C . 

3.2 The right tail for the maximum 

In this subsection, we compute the right tail for the maximum of the MBRW. 
Lemma 3.7. There exists a constant C > such that for all y £ [1, y/n) and n large enough, 
^-iy^-2y^y ^ ip(max^^^ yrfiN + y)^ Cye-^'^y . 
Combined with Lemma 12.61 the preceding lemma directly yields Theorem 11.41 

Proof of Lemma 13.71 The upper bound is an immediate comparison argument. Consider the 
MBRW and consider the associated BRW As noted in [17, Prop. 3.2], E(^f )2 = E(^f )2 
and there exists a constant C such that for v ^ v' , 

E^c^ +0 Ee?"j . 

Let G,G^ be iid Gaussian variables of zero mean and variance C, independent of the fields {^,'^}. 
Set = + G and jl^ = +Gy. Clearly, it is still the case that E(//^)^ = E(/i^)^, while now, 

^ ^Uvl^v' . for t; / . 

We conclude from Slepian's lemma that 

P(max^/i^ ^ t) ^ P(max^/i^ ^ t) ^ ip(max^^^ ^ t) . 

(The last inequality because P(G ^ 0) = 1/2.) On the other hand, max^ is trivially stochasti- 
cally dominated by maXj; ^ . Combining these with the upper bound in (|26p yields the upper 
bound in the lemma. 



The main work goes to the proof of the lower bound. Recall that N = 2'^. Set a„ = 2-^log 2n — 
■■ log n. To simplify notation, we drop the superscript and denote by {^t, : u G F} a MBRW 



4Vlog2 

of n levels. For ^ t ^ n, let ^v{t) be the sum of the Gaussians variables in the first t-levels for 
(i.e., summing over the Gaussian variables associated to boxes of side length 2", 2"~^, . . . , 2"~*). 
Define 

My) = {iv G [an + y-l,an + y],iv{t) ^ ^+yVt G [n]}, and Z{y) = ^ l^Jy) . 

Therefore, writing $,v{t) = iv{t) — ^ we can compute 

r{A,{y)) = FiUn) G [y - l,y],Ut) ^ y for all t G [n]) . 
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Let Q be a probability measure under which is a Gaussian random walk. Then we have 



= e n 2^" . (48) 



Altogether, we obtain that 



P(^,(y)) = Eq( — = e-2ne- « ^Q(^,(y)) 

where the notation x means that the ratio of the left and right hand sides is bounded above and 
below by absolute positive constants. Note that we have applied the Ballot theorem (see, e.g., [B 
Theorem 1]) to estimate Q(A^(2/)). This implies that 

EZ(y) X e-2v^yy . (49) 

Next we turn to computing the second moment of Z{\j). To this end, consider v and w such that 
V and w splits in level ig = n — s (denoted by ?; ~s w). That is to say, the boxes of side length 2* 
associated to v are disjoint from those associated to w. Write Ci,(t) = ^i;(i) — C«>(*) = ^i«(*)~if 
We compute (writing an = an/n) 

=nUt) ^ y,iUt) ^ y for all t G [nlUn),i^{n) G [y - l,y]) 

= Y^¥{Ut) ^ y,^w{t) ^ y for all t e [n],Un),^M G [y-l,y],Uts) G [2- 1,^]) 
<Y.^iUt)^y, for alHG [t,],e.(t.) G [z - 1, z])r2_,_, , (50) 

where 

ry,^,s = sup P(?i,(0 ^ y for all ts <t ^ n,^^{n) G [y - l,y] | ^v{ts))- 



Note that in (I50p . we have an inequality as opposed to an equality which would hold for BRW. For 
V ~s w, the processes {^v{t) '■ t £ [ts]} and {Cw{t) : t £ tg} are not precisely the same and therefore 

{Cv{t) ^ y, for all t G [ts],Uts) e[z- l,z]} / {e«,(t) ^ y, for all t G [t.],^"^(t«) G [z - 1, z]} . 

This explains the inequality in ()50p . By the Ballot theorem, 

Tj;,.,, ^ P(e.(0 ^ y - z for all r G [s],C".(s) G [y - 1 - ^, y - ^]) < ( ^"3^/2^ V > (51) 

where the notation < means that the left hand side is bounded by the right hand side up to an 
absolute constant. Recalling (I48|) and applying a slight variation of the Ballot theorem (see, e.g., 
[391 Corollary 2]), we obtain that 

FiUt) ^ y, for all t G [ts],Uts) e [z - l,z]) < e-#("-^)e-4-V-"^e-2""fe-^) yi^^i±ll . 

(n — sj-^/^ 
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Plugging the preceding inequality and ([51]) into (|50|) . we get that 

F(A,.) n < „ . , |;e-#e-4.e-"^.»e--|^ < ^'-''^^J:;;:'- . 

where the summation is over all z such that y — z is a non-negative integer. Summing over v w 
and also over s, we obtain from a straightforward computation that 

" " „3/2 3s/2n 3s/2n 



S = l D~s1i) 



<2/e-""^^— ^ + ye-°"^ ^ _____ < ye'^-^ 

s=l s=n/2 



Recalling (j49p and that an = CLn/n, we complete the proof on the lower bound. □ 



4 Maxima of the Gaussian free field 

This section is devoted to the study of the maxima of the GFF, for which we will harvest results 
from previous sections. 

4.1 Physical locations for large values in Gaussian free field 

This subsection is devoted to the proof of Theorem II. li We first briefly explain the strategy for 
the proof. Suppose that there exists a number e > such that the limiting probability in ([1]) is 
larger than e along a subsequence {vk}. Then, we can take N' x N/e such that the same limiting 
probability with N replaced by A^' will approach almost 1. This would then (roughly) imply that 
the expected value of rj'^, will exceed 2rn,7v — log log — 0(1) (where 5 > is a small number), 
contradicting with Corollary 13.61 as k ^ oo. The details of the proof are carried out in what follows. 
We start with the following preliminary lemma. 

Lemma 4.1. For N' > 8A^, consider a discrete ball B of radius 8N in a box V/v of side length N' . 
Let B* d B be a box of side length N such that the centers of B and B* coincide. Let {rj^ : v G Vtv'} 
be a GFF on V^' with Dirichlet boundary condition and let 

-01, = E(ry^ I : u G dB}) . 

Then for v e B* , we have Var V't, = 0(log(iV7iV)). 

Proof. We need the following lemma, which implies that the harmonic measure on dB with respect 
to any v £ B* is comparable to the uniform distribution. 

Lemma 4.2. f35[ Lemma 6.3.7] Let €„, C be a discrete ball of radius n centered at the origin. 
There exist absolute constants c, C > such that for all x £ C„/4 and y G dCn 

c/n^F^{T9c„=y) ^C/n. 
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The Gauss-Markov property of the GFF allows one to write the conditional expectation for GFF 
at a vertex given values on the boundary as a harmonic mean for the values over the boundary (see 
e-g- |27t Theorem 1.2.2]). Combined with the preceding lemma, this implies that for v £ B* C B, 
we have 

^py = a„,^r/u;, where c/N ^ a„,^ ^ C/N . (52) 

wedB 

Therefore, we have 

VarV„ = e(l/iV2) J2 Gdv^,{u,w). (53) 

u,w&dB 

In order to estimate the sum of Green functions, we use the next lemma. 
Lemma 4.3. J3E[ Prop. 6.4-1] For i < n and x G C„ \ Ce, we have 

log|x|-log^ + 0(l/f) 

^x[Tacn < T^dCi) = — 7 • 

log n — log t 

By the preceding lemma, we have 

^u{Tav^, < T+s) ^ 0{l/{N\og{N'/N))) for all n G 9i? , 

where r^^ = min{t ^ 1 : G dB}. Thus, E«,eai? ^ay^, (n, w) = 0(iV log (iV'/iV)). Therefore, 

Var(V',,) = 0{\og{N'/N)) , for all v £ B* . □ 

The following lemma, using the sprinkling idea, is the key to the proof of Theorem 11.11 In the 
lemma, for e, 5 > we set C{6,s) = 21og5/log(l — e). 

Lemma 4.4. There exist a a constant C > such that, if 

¥(3v,u G Vn : r ^ \v — u\ ^ N/r and riu,r]y ^ — A) ^ e (54) 

for some e, A > and N,r then for any 5 > 0, setting N' to be the smallest power of 2 larger 
than or equal to C{6,e)N and 7 = C(y^log C((5, £)/S), the following holds 

F{r]%,^^ ^ 2mN - 2A - 7) ^ 1 - 5 . 

Proof Let N' = N2''+^ with k = [logg C{6, e) - 3]. Bi, . . . ,B2k C Vn' be disjoint discrete balls of 
radius 8A^, and for i G [2^] let B^ C i?j be a box of side length N such that these two centers (of the 
ball and the box) coincide. Let {rj'^ : v G Vtv'} be a GFF on VV' with Dirichlet boundary condition, 
and for i G [2^^] let {rj^^ : v G Bi} be i.i.d. GFFs on Bi with Dirichlet boundary condition. We first 
claim that for all i G [2^^] 

¥{3v, ueB* -.r i^\v-u\i^ N/r and r/^ + r/W ^ 2mN - 2A) ^ e/2 . (55) 

In order to prove the preceding inequality, we consider the decomposition of {r}h^ : v ^ B^} (by 
conditioning on the values at dB* analogous to (fTHl) ) as 

= r?«'* + (P^ for ah v G B* 
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where {rjv * : v € B*} is a GFF on B* with Dirichlet boundary condition and is independent of the 
centered Gaussian process : v E B*}. Note that cpy here denotes the conditional expectation of 
ijI given the values on OB* . Let Ti(i),T2{i) £ B* be the locations of maximizers of 

max{r/^*^'* + r/^*-*'* : u,v £ B* ,r ^ \v — u\ ^ N/r} . 

By Assumption ([5^ . we have 

Since </>ri(i) + 0r2(«) is a centered Gaussian variable that is independent of Vr^l^i) + ^r2^(i)' 
deduce ([55]) as required. 

Let us now consider the decomposition for {r/^ : v G Vj\f'}. We can write 

= Vv^ + for V G and z G [2^'] , 

where {V'd : f G -B*} is a Gaussian process independent of {7]^^ : i G [2'^],^; G -Bj}, and furthermore 

V^, = E(7?; I : G aSj) , for G B* . 

By Lemma |4.H we obtain that Var^^ = 0{k) for all v € B* and i G [2*^]. 
Next, let t G [2*^] be the location of the maximizer of 

By the independence of {t]^^^} for i G [2*^], we deduce that 

nv%) + V%) > '^ruM - 2A) ^ 1 - (1 - e/2f . 
Conditioning on the location of l and ri(i-), r2(t), we see that Var(-(/'T-i(i) +iPt2{l)) — 0{k). Therefore, 

]P«W + i^W ^ 2m;v - 2A - 7) ^ (1 - (1 - e/2)2')(l - ^) , 

where we simply used Markov's inequality to bound the probability P('!/'7-i(t) + V'r2(t) ^ ~^)- With 
our choice of fc, 7, this completes the proof. □ 

We next bound the lower tail on rf^ ^, from above. To this end, we first show that the maximal 
sum over pairs for the GFF has fluctuation at most O(loglogr). 

Lemma 4.5. For any r ^ N , letrf^^ he defined as in ()14p . Then the sequence of random variables 
{{Vn j. - E?7^ ,.)/loglogr}Ar,r is tight along iV G N and r G {0, . . . , iV}. 

Proof. To simplify notation, we consider the sequence = 2" in the proof (the tightness of the full 
sequence will follow from the same proof with slight modification by considering n{N) = max{A; G 
N : 2'' ^ A^}). To this end, we first claim that 

^'n2N,r ^ lEmaxjZi, Z2} , (56) 
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where ^1,^2 ~ "Hn r ^^'^ ^1 independent of Z2. The proof of (j56p follows from the similar 
argument as in the proof of LemmaEU as we sketch briefly in what follows. Consider V/v, 
where Vn and are two disjoint boxes of side length A^. Using a similar decomposition as in 
(fTHl) . we can write r/^^ = rj^ + for v S Va^ and r/^^ = 5?^ + 4>v for f G V^, where i]!^ and 57/^ 
are two independent copies of GFF in a 2D box of side length A^. This yields (j56p . Now using the 
equality a V 5 = ^ -^^g deduce that 

E|Zi -Z2I ^ 2(E??^jv,r -EZi) ^ 2C71oglogr, 

where the last inequality follows from Corollary 13.61 This completes the proof of the lemma. □ 

Based on the preceding lemma, we prove a stronger result which will also imply that the number 
of point whose values in the GFF exceed tjin — X grows at least exponentially in A. We will follow 
the proof for the upper bound on the lower tail of the maximum of GFF in |23l Sec. 2.4]. For 
TV, r E N, define 

^Af,r = {{u, v) £ Vn X Vn ■ r ^ \u - v\ ^ N/r} . 
Lemma 4.6. There exists absolute constants C, c > such that for all N & N and r,X ^ C 

F{3A C EN,r with \A\ ^ logr : V(n,u) e A : t]u + Vv ^ '^ruN - 2Aloglogr) ^ 1 - (^e"^'^ '"^ '"'^ " . 

Proof. The proof idea is similar to [23j, and thus we will be brief in what follows. Denote by 
R = N(logr)^^^^^ and i = N{logr)^'^^^^^ . Assume that the left bottom corner of Vn is the origin 
o = (0,0). Define a = {ii,2R) for 1 ^ i ^ M = [N/2l\ = (log r)^/i°°/2- Let d be a discrete 
ball of radius r centered at Oj and let Bi C Cj be a box of side length iZ/8 centered at Oj. We next 
regroup the M boxes into m blocks. Let m = (log r)'^/^^'^, and let £j = {Cj : {j — l)m < i < jm} 
and Bj = {Bi : (j — l)m < i < jm} for j = 1, . . . , Al/m. 

Now we consider the maximal sum over pairs of the GFF in each Bj. For ease of notation, we 
fix j = 1 and write B = Bi and € = €1. For each B £ B, analogous to (fT6]l . we can write 

Vv = + (Pv ^or Sill V £ B QC e (t, 

where {g^ : v G B} is the projection of the GFF on C with Dirichlet boundary condition on 
dC, and {{g^ : v G B} : B £ B} are independent of each other and of {rj^ : v G d€}, and 
= lE(r/^ I {r]u '■ u G d^}) is a convex combination of : u G SC}. For every B £ B, define 
ixi,B,X2,B) £ B X Bn EN,r such that 

9i,XB +9x2B= 9v +9u ■ 

Since A is large enough, we get from Corollary 13.61 and Lemma 14.51 that 

^(9?,^B + 5x2,fl ^ 2m;v - A log log r) ^ 1/4 . 

Let W = {{xi,b,X2,b) '■ 9ixB ~^ 3x2 b ^ 2mAr — Aloglogr, S G B}. By independence, a standard 
concentration argument gives that for an absolute constant c > 

¥{W ^ |m) ^ e-^" . (57) 

It remains to study the process {(j)u + 4>v '■ (u, v) G W}. li (j)u + (pv ^0 for (n, v) £ W , we have 
Vu + Vv ^ 2m N — A log logr. The required estimate is summarized in the following lemma. 
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Lemma 4.7. |13 Lemma 2.3] Let U C LIb^bB x B such that \U n B x B\ ^ I for all B e B. 
Assume that \U\ ^ m/8. Then, for some absolute constants C,c> 

H(pu + 0^ ^ for all {u,v) £ U) Ce'^^^"^''^'^ . 

Despite the fact that we are considering a sum over a pairs (instead of a single value (p^) in 
the current setting as well as slightly different choices of parameters, the proof of the preceding 
lemma goes exactly the same as that in [23j. The main idea is to control the correlations among 
{(pu + <pv) for (u, v) G U. Indeed, one can show that the correlation coefficient is uniformly bounded 
by 0(A log log r R/i) ■ Slepian's comparison theorem can then be invoked to complete the proof. 
Due to the similarity, we do not reproduce the proof here. 

Altogether, the preceding lemma implies that 

P(maxBge ^a^v,veBxBnSN,r'nu + Vv ^ 2mAr - 2Aloglogr) ^ 1 - C'e"'^^'"^'")"^ . 
Now, let (xi,j)X2,j) G Bj X Bj n Hat,, be such that 

Vxi , + 11x2 i = ™ax max r]u + 'ny, 

and let A = {{xi,j,X2,j) : 1 ^ i ^ M/m}. A union bound gives that min(-„_„)g^ r/^ + rjy 
2m]\f — 2Aloglogr with probability at least 1 — (jQ-ciiogr)"^ ^ concluding the proof. □ 

The following is an immediate corollary of the preceding lemma. 

Corollary 4.8. There exist absolute constants C, c > such that for all N gN and X,r ^ C 

P(7?^,, ^ 2mN - 2Aloglogr) ^ 1 - ^e""'''""" . 

We are now ready to give 
Proof of Theorem 11.11 Suppose that the conclusion in the theorem does not hold. This implies 
that for a particular choice of c = ci/8 (where ci is the constant in Corollary 13. 6p there exists e > 
and a subsequence {r^} with — )-fc^oo oo such that for all k 

limsupP(3ti, n G Vn : rk ^ \v — u\ ^ N/r^ and ??u,r?t, ^ niN — cloglogr^) ^ e . 
Then by Lemma 14.41 for a 5 > to be specified and C{e, 5) > 0, we have 

limsupP(3u,u G Vc(£^s)N ■ ^ l^^-^^l ^ C{e,5)N/rk ,'nu+r]v ^ 2mAr-2clog log rA,.-C(e, 5)) ^ 1-5. 

N-^oo 

Now we consider random variables W]y,k = 2mN — 2c log log — C{e,5) — ffc(e6)Nri,' 
preceding inequality, for any 5 > there exists an integer A'^^ such that P(VFAr,A,. ^ 0) ^ 25 for all 
N ^ Ns and A; G N. By Corollarv 14.81 we see that for absolute constants C*,c* > 

HWN,k ^ Aloglogrfe) ^ c'*e-c*e=*{^-2c)iogiog.,^ N,k,\^C\ 
Therefore, for N ^ Ns and ^ e'^ V C*, we obtain that 

foo poo ^ 

¥WN,k ^ loglogrfc / nWN,k > Aloglogrfc)(iA ^ log log / (25) A (C'^e''^*^^ (A-2c),ogiog.,^^^ 

JO JO 

^^cC^-Jloglogrfc + loglogrfc / (25) A (C^e""*"' ^)dA ^ Ac,c*,c*<51oglogrfc , 

Jo 



23 



where Ac^c* > is a number depending on (c, C*) and Ac^c*,c* > is a number that depends only 
on (c, C* , c*). Recalling that c = ci/8 and choosing 5 = ci / AAc^c* ,c* , we then get that for N Ns 
and rfc ^ e'^, 

^Vcie,S)N,,r, ^ '^^N, " y log log Tfc - C7(e, 5) for ah A: G N . (58) 
This contradicts with Corollary 13.61 (sending A; — )■ oo), thereby completing the proof. □ 

We conclude this subsection by providing 
Proof of Theorem II. 2L The lower bound on ^Ia.a^ follows immediately from Lemma 14.61 A 
straightforward deduction from Theorem 11.11 together with a packing argument yields an upper 
bound of merely doubly-exponential on ^a,7V- In what follows, we strengthen the upper bound to 
exponential of A. Continue denoting Si^n ^-nd TZ^^n ^ in Lemma 12.91 Following notation as in 
Section 13.11 we see that 

7^,,^ < iTj, - 4^1{|H^,iog^/(2c*)l ^ Wlog^, 

where H^^. = Ui=tjv-Tjv '^^(^)- ^PPlyi^g ([^^ and ([2^ . we deduce that there exists a constant 
c > such that for sufficiently large I 

mZii^N ^ ^(A/21og2/7rmAr - clog£) . 

Combined with Lemma l2.9t it follows that for sufficiently large ^ 

E5^,iv ^ ^(V21og2/7rm^ - clog^) . (59) 

At this point, the proof can be completed analogous to the deduction of (j58p . as we sketch below. 
Suppose otherwise that for any a > there exists a subsequence {vk} such that for all k there 
exists a subsequence N^^i with 

n\AN,,,r,\ ^e'^)^e, for aUiGN, 

where e > is a positive constant. Then, following the same sprinkling idea in Lemma 14.41 we can 
show that for any (5 > 0, there exists C(e,(5) such that for A^^ ^ = C{5,e)Ni^^i and 7 = 7(5, 5), the 
following holds 

P(|^;v^^^,.,_,|^e-'=)^l-5. 
Combined with Lemma 14.61 it follows that 

ESe<^r^^^,^ ^ ^ e"*"* (A/21og2/7rmjv^ , ~ + c''^a)?'fc - 7) > 

where c' > is a constant that arise from the estimate in Lemma 14.61 Now, setting 5 = {c/2d), 
a = A/c and sending — )• 00, we obtain a bound that contradicting with ()59p . completing the 
proof of Theorem 11.21 

4.2 The gap between the largest two values in Gaussian free field 

In this subsection, we study the gap between the largest two values and prove Theorem 11.31 
Upper bound on the right tail. In order to show the upper bound in (l2|), it suffices to prove 
that for some absolute constants C, c > and all A > 

P(A < Ftv ^ A + 1) ^ P(riv ^ 1) • Ce~^^' . (60) 
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To this end, define 

^^A = {{x^,)^^Vn ■ liixv)) e (A, A + 1]} for all A ^ , 

where 7((xt,)) is defined to be the gap between the largest two values in {x^}. For (x^)^gvw £ f^A, 
let r G Vat be such that Xr = max^gy^^ x^. We construct a mapping (px '■ ^ that maps 
{xv)yfzvr^ G to {yv)v&VN such that 

Uv = Xy ii V ^ T , and y-r = x-r — \ ■ 

It is clear that the mapping is 1-1 and {yy)v<^VN ^ ^o- Furthermore, the Jacobian of the mapping 
is precisely 1 onVLx- It remains to estimate the density ratio f{{xv))/f{{yi,)). Using Q, we get 
that 

f(^(^Xy)) = Ze~i^ ^"-''^^""'^"^^ = Ze"3^^"~"^^"~^"^^e~^^''~^^^''""^^^^~^^''~^^^^^ ^ f{{yv))e~^^^ ■ 
It then follows that 

¥{{ri:f)enx)^e-''/M{v^)^^o), 

completing the proof of (fGOl) . 

Lower bound on the right tail. In order to prove the lower bound on the right tail for the gap, 
we first show that with positive probability there exists a vertex such that all its neighbors in the 
GFF take values close to niN within a constant window. To this end, we consider a new Gaussian 
process {(y : v G Vn} defined by 

Cv = iT.ur^v'nv for v(^Vn\ BYn , and C\dV = . (61) 

In addition, we denote by and the collection of even and odd vertices in V/v, respectively. 
Note that Vj^ = V^yj V°. 

Lemma 4.9. For every e > 0, there exists a constant > such that 

P(maxt,gv« Ci; ^ "I AT - Ce) ^ I - e . 
Proof. We will apply Lemma [2^21 For k G N to be specified later, define (/)«;(•) : ^2-«Af ^ 

(t>K{v) = , for all v G V2~iiN ■ 
Let {r/^ : t> G V2-k]^} be a GFF on 1^2-'= Af- We claim that for large k (independent of A^) 

E(C^4„) - C^4.))' ^ nvr^" - V^ f, for ah u,v G V^-.^ ■ (62) 
In order to see this, we note that by ^ and the triangle inequality 

Vaic{r]y) = Vavrju + 0(1) = Cov(r/„,r/„) + 0(1) , for all n ~ t; . 
This then implies that 

IE(C^ - Cu? = - Vv? + 0(1) , for ah u,veVN. 
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Now again using the fact that 

grows with k, we could select k, large (though independent of N) to beat the 0(1) term, and thus 
obtain (162p . At this point, an application of Lemma 12.21 and (jl} yields that 

EmaxC« = m7v + 0(l). (63) 

In addition, it is clear that maxt,gv^ (y ^ maxtjgvv Vv- Therefore, ([5]) implies an exponential right 
tail for maxtjgv^ C^,. Together with ()63p . this completes the proof of the lemma. □ 

For e > 0, let be defined as in the preceding lemma, and define 

For (xy) £ Cle, let v* £ be such that lJ2ur^v*^u = "oiaXy^v^l^^^^Xu- Let = {(x„) G 0^ : 
^v* — jYlur^v*^u ^ (~C'*,0)}. Note that {C^, : v G V^} is measurable in the u-field generated by 
{rjy : V G V^}. Applying Markov field property of the GFF and Lemma 14.91 we obtain that there 
exists C* sufficiently large (depending only on e) such that P( (?/?;) € fi*) ^ 1 — 2e. By ([5]), we see 
that there exists a constant > (depending only on e) such that 

P({??4G5^e)^l-3e, (64) 

where = n {max^gy^ i^^^^x^ ^ max^igVAr ^^t, — C^}. Now choose e = 1/4. For A ^ 0, 
define a map ^'a : ^y^^ ^ by ^x{{xv)) = {Vv) with 

= Xt, for all v ^ v* ^ and y^,* = 2max^Xt, + A — x^* . 

The somewhat strange definition of y^* above (as opposed to set y„* = max^, Xy + A) is for the 
purpose of ensuring the mapping to be bijective. By definition, we have that 

x{{xv))) = 2 max Xy + \ — Xy* — maxx^, ^ A , 

V V 

for all (xy) £ ^1/4- It is also obvious that ^\ is a bijective mapping and that the determinant of 
the Jacobian is 1. In addition, it is straightforward to check (by definition of fi*) for some absolute 
constants c*, C"^ > 

fi^xiixv))) > c%-^''^'f{{xy)) , for ah [xy] e . 

Integrating over and applying we complete the proof for the lower bound in ([2]). 

Lower bound on the gap. For any e > 0, we let and v* be defined as above such that (j64p 
holds. Denote by r the maximizer of maXy^vN By Theorem 11.21 there exists C* > such that 
^ 1 - 4e, where 

n^e = ^tr\ {(xv) : \{v: Xy ^ Xr' - C; - 1}| ^ Ct} . 
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Consider < 5 < 1, and define Cj = {{x^) ■ {i — l)6 ^ 7((2;„)) < i5} for all i ^ 1. We then construct 
a mapping <I>j : $7* n Ci i— by (say maps to {uv)) defining 

Uy = Xv V ^ {v* , t} , and y^* = Xr + i6 , 

and in addition = x^,* if v* / r. For all (x^) G rig and i = 1, ... ,1/5, it is clear that 

/((x,))<C^/(cI.,,((x,))), 

where C'^ is a constant that depends on e. In addition, for all G il*nCi we see that ^i{{xy)) G 
Cj+i. Furthermore, every image has at most C* + 1 pre-images in fi* nCi. In order to see this, 
we note that there are two cases when trying to reconstruct (xy) from (yy): (1) v* = r, in which 
we obtain one valid instance of (xy); (2) v* ^ r, in which we obtain at most C* valid instances 
of (xy). This is because by definition v* is the maximizer of max^gVjv Vv', and r satisfies that 
yr = Xy* Xr — C* , and there are at most C* locations whose values in y. is no less than Xr — C*. 
Once we locate v* and r, the sequence {xy) is uniquely determined by {yy). Altogether, we obtain 

¥{{r,y) G 0* n Ci) ^ cue: + iMiVv) G Ci+i) 

for all 1 ^ z ^ 1/5. Since Cj's are disjoint, we obtain that 

Flirty) entr^c,)^2CUc: + 1)5. 

Now, sending 5 — )• and then e — )• completes the proof of ([3]). 
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